Study on Identification of Subjective Sentences in Product Reviews Based on Weekly Supervised Topic Model

نویسنده

  • Wei Jiang
چکیده

Sentiment analysis or opinion mining in online product reviews is a method that can automatically detect subjective information regarding the entity such as opinions, attitudes, and feelings expressed by consumers. Online product reviews always include objective and subjective sentences; identification of subjective sentences in the given content is a very important and foundational task in the research of opinion mining. In this paper, we focus on the problem of identification of sentence-level subjective sentences, propose a weakly supervised model mixed topics based on LDA for identification of the subjective sentences, considering the impact of multiple topic factors on the identification of subjective sentences. The approach exploits semi-supervised learning method, and extended the existing basic LDA topic model for the identification of subjectivity in text. This work iterates the model prior probability by using a small domain-independent lexicon. Finally, the proposed model is applied to a online review corpus and the experimental shows that the proposed model can effectively improve the recognition effect.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Subjectivity Classification using Machine Learning Techniques for Mining Feature-Opinion Pairs from Web Opinion Sources

Due to flourish of the Web 2.0, web opinion sources are rapidly emerging containing precious information useful for both customers and manufactures. Recently, feature based opinion mining techniques are gaining momentum in which customer reviews are processed automatically for mining product features and user opinions expressed over them. However, customer reviews may contain both opinionated a...

متن کامل

ICWSM - A Great Catchy Name: Semi-Supervised Recognition of Sarcastic Sentences in Online Product Reviews

Sarcasm is a sophisticated form of speech act widely used in online communities. Automatic recognition of sarcasm is, however, a novel task. Sarcasm recognition could contribute to the performance of review summarization and ranking systems. This paper presents SASI, a novel Semi-supervised Algorithm for Sarcasm Identification that recognizes sarcastic sentences in product reviews. SASI has two...

متن کامل

Identifying and classifying subjective claims

To understand the subjective documents, for example, public comments on the government’s proposed regulation, opinion identification and classification is required. Rather than classifying documents or sentences into binary polarities as in much previous work, we identify the main claim or assertion of the writer and classify it into the predefined classes of opinion (attitude) over the topic. ...

متن کامل

Using Topic Sentiment Sentences to Recognize Sentiment Polarity in Chinese Reviews

An approach to recognizing sentiment polarity in Chinese reviews based on topic sentiment sentences is presented. Considering the features of Chinese reviews, we firstly identify the topic of a review using an n-gram matching approach. To extract candidate topic sentiment sentences, we compute the semantic similarity between a given sentence and the ascertained topic and meanwhile determine whe...

متن کامل

The CLSA Model: A Novel Framework for Concept-Level Sentiment Analysis

Hitherto, sentiment analysis has been mainly based on algorithms relying on the textual representation of online reviews and microblogging posts. Such algorithms are very good at retrieving texts, splitting them into parts, checking the spelling, and counting their words. But when it comes to interpreting sentences and extracting opinionated information, their capabilities are known to be very ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • JSW

دوره 9  شماره 

صفحات  -

تاریخ انتشار 2014